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Abstract 

We consider five different peer-to-peer file sharing systems with two chunks, with the aim 
of finding chunk selection algorithms that have provably stable performance with any input 
rate and assuming non-altruistic peers who leave the system immediately after downloading 
the second chunk. We show that many algorithms that first looked promising lead to unstable 
or oscillating behavior. However, we end up with a system with desirable properties. Most of 
our rigorous results concern the corresponding deterministic large system limits, but in two 
simplest cases we provide proofs for the stochastic systems also. 

1 Introduction 

We consider an open network, with constant rate of incoming 'peers' . A peer is assumed to be able 
to contact and communicate with any other peer in the system (technically, this can be realised by 
an overlay network built upon the Internet, where the knowledge of a peer's IP-address enables 
communication with it). We assume that one special, persistent peer, the 'seed', holds a file and 
wishes to distribute it to all the others. The most effective way of doing this, in particular when 
the number of peers is very large, is that as soon as a peer receives the file, it becomes a seed 
itself. The number of copies of the file then grow exponentially. To enhance performance, the file 
is divided into small chunks that are spread in similar fashion so that parts of the file may start to 
be multiplied before the original seed has even once uploaded the whole file. This technique was 
introduced by B. Cohen with his BitTorrent [1] protocol. It became soon the dominant principle 
of sharing large files (e.g., movies) with peer-to-peer networking. 

Moreover, the peers can be 'non-altruistic' in the sense that they leave the system immediately 
having downloaded the whole file, without necessarily slowing the system performance. It is 
remarkable that if there are more than one chunks, it seems that any arrival rate of new peers can 
still be sustained. However, as we shall see, some additional algorithms are then needed for stable 
performance. This paper analyses several such algorithms in the simplest relevant case of two 
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chunks. In real systems the number of chunks is large, e.g. of the order 10 3 . However, even the 
case of two chunks, considered here, is highly non-trivial. In fact, it may also be the hardest case 
as regards stability. For proving stability it is anyway the easiest case, and hopefully the solutions 
for the two chunk case appear useful in more general models as well. 

We work on a fully distributed scenario, relying on randomness: each peer contacts another, 
uniformly randomly chosen peer, according to a standard Poisson process, and gets to know what 
chunks the latter possesses. What follows, depends on the particular algorithm. Massoulie and 
Vojnovic [7] were the first to propose this model and to obtain rigorous mathematical results on it. 
They allowed an arbitrary number of chunks and analysed the corresponding deterministic large 
system limit (see Section 2) with the following remarkable result: if each new peer arriving in the 
system obtains a roughly uniformly random chunk, the system is stable even if all the remaining 
chunks are downloaded randomly. (In the special case of two chunks, it is noted in [7] that the 
scheme does not give a unique rest point in the large system limit, but for any larger number of 
chunks it does.) However, the results of [7] prompt for further research in at least two directions. 
First, the scheme where the seed gives a uniformly distributed chunk to every new peer makes the 
seed a potential bottleneck — thus, this algorithm is not as fully distributed as it could be. Second, 
the stability of the large system limit does not automatically guarantee the stability of the original 
random system. 

We focus on entirely distributed solutions, where also the first chunk must be found randomly 
from the peer population, and assume 'non-altruistic' peers. In our first paper [8] we considered 
the so-called flash-crowd scenario where a large number of peers arrive simultaneously but none 
afterwards. It was noticed that the first phase of the copying process is asymptotically (with 
increasing number of peers) equivalent to Polya's urn model, which is well known to converge 
to a random proportion of each chunk in the system. This imbalance leads to the 'rare chunk 
phenomenon': one of the chunks is not able to become common and as a result forms a bottleneck 
of performance (see also [10]). In an open variant of this setup, with continuously incoming 
peers, this could lead to instability: the number of peers in the system could grow unboundedly, 
since more and more peers would be searching for the one rare chunk. (BitTorrent counteracts 
to the rare chunk phenomenon by its 'Rarest First' principle, and our last two algorithms can 
be seen as distributed ways to implement this principle using very coarse rarity estimates.) The 
central question of this paper is: is it possible to avoid the severely imbalanced chunk distribution, 
implying instability, without using centralized coordination of downloads? 

One source of ideas for this is provided by the wide literature on urn models (originally often 
related to physics; for a recent review, see [9]). For example, Ehrenfests' urn model gives an 
almost ideal balance in a closed system. Still more relevant is the so-called Friedman urn, with 
an analogous result for an open system, with a flow of incoming particles. We showed in [8] that 
if an empty node first contacts a node having chunk (1) but then downloads the opposite chunk 
1 (0) first (neglecting for a while the question how a peer with that chunk could be found), the 
distribution of chunks converges almost surely to | — \ as the number of peers goes to infinity. 

In this paper, we analyse five two-chunk models: (i) the Plain Random Contact system, which is 
found to be very unstable; (ii) the Deterministic First Chunk system, proposed in [8], but found 
unstable in the present scenario; (iii) the ideal Friedman system (non-implementable in a dis- 
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tributed way in our scenario), which is proven to be stable; and two distributed algorithms that 
try to approximate the Friedman system: (iv) the Delayed Friedman system, which may be stable 
but oscillates heavily, and, finally, (v) the Enforced Friedman system which seems to provide the 
desired performance. 

The paper is structured as follows: general definitions and some preliminaries are given in Section 
2, and the five models are studied in Section 3, each in its own subsection. Some concluding 
remarks are made in Section 4. 



2 Definitions and preliminaries 

We study time-homogeneous continuous time Markov processes S = (S t )t>o with state space 
N d , where d is 2,3 or 4, depending on the particular model. Denoting the ith unit vector by e i5 
% — 1, . . . , d, the transitions are always of one of the three forms 

S ^ S I (3^ . S ^ S j S ^ S | 63^ & j • 

Denote the transition intensity from state m to state n by q(m, n). The process S is thought as 
a model of a queueing network, where the state component i presents the number of customers in 
network node i, i — 1, . . . , d. 

The Markov process S is called stable, if it is irreducible and positively recurrent. This is equiv- 
alent to the existence of a unique stationary probability measure. Assuming irreducibility and 
finiteness of transition graph neighborhoods, stability is equivalent to the existence of a finite set 
of states CcN d such that with any starting point S , the process reaches C in a time with finite 
expectation. 

Let F : (0, oo) d — > W 1 be a Lipschitz continuous function so that the autonomous ordinary 
differential equation 

s = F(s) (1) 

has a solution s(t), t e [0, T), T e (0, oo], for every starting point s(0) G (0, oo) d . We say that 
the dynamical system (1) is the large system limit of the Markov process S, if 

F i (x)=\im Q L([Nx\,[Nx\+e i ) + ^ ^ Vi q( [Nx\ , [Nx\ + v) J, i = l,...,d, 

y «0{ei,...,e d } / 

where [yj denotes the largest integer less than or equal to y and is defined componentwise for 
vectors, and v = (i>i, . . . , va) runs over the different possible transition vectors. The idea is to 
scale the arrival rates to the system (transitions v = e^) as well as the states by N, divide by 
N and take the limit. Thus, we assume the internal transition and exit rates to be linear in iV 
as functions of the state. Conditions of limit theorems showing the convergence of the Markov 
process towards a deterministic limit system in such scaling have been established by Kurtz [5]. 
This type of results, however, tell nothing about the stability of a stochastic system with finite N, 
and therefore we don't review them closer here. 
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The dynamical system (1) is called locally asymptotically stable around an equilibrium state s* 
(that is, a state with F(s*) = 0), if there exists an open set U containing s* such that lim^oo s(t) = 
s* for any initial state s(0) G U. The system is called globally asymptotically stable, if it has a 
unique equilibrium state s*, such that lim^^ s(t) = s* for any initial state s(0) G (0, oo) d . 
Finally, we call the system (1) (globally) stable, if there is a compact set K C (0, oo) d such that 
for any initial state s(0) G (0, oo)^ the system reaches if and eventually stays in K. 

The stability of a large system limit is not known to be sufficient nor necessary for the stability of 
the original Markov process. We are not aware of any rigorous results concerning this question, 
but the following remarks illuminate the difficulties in relating the two notions. First, assume that 
the large system limit exists and is globally asymptotically stable. When d > 3, the trajectories 
can however be very complicated and the convergence toward equilibrium very slow. A stochastic 
system, how well ever fitted to the continuous state space, evolves in jumps and does not follow 
any trajectory — only its local drift is in the best case close to the derivative of a trajectory 
passing the same point. Thus, it is hard to imagine how the stability of the stochastic system could 
be deduced without more specific assumptions. Second, a dynamic system can escape to infinity 
along a single trajectory (say, along the diagonal) while all other trajectories end to a compact 
set. In such a case, the stochastic systems with all N could however be stable, since randomness 
forces them to deviate from the transient trajectory. 

However, these circumstances often coincide, and, as we shall see here also, proving the stability 
of the dynamical system is usually much easier than proving the stability of the Markov process. 
Therefore it is interesting to consider the large system limits together with the original random 
systems. 

All the systems studied in this paper possess differentiable large system limits, and the existence 
of a unique solution from any starting point is thus always granted. They are, however, non-linear, 
and proving their stability seems to be very hard in some cases. There are no black-box tools 
applicable in general. 

The following elementary lemma is sometimes useful when considering the asymptotic behaviour 
of a dynamical system. For completeness, a proof is given in the Appendix. 

Lemma 2.1 Let a and b be Lipschitz continuous functions [0, oo) — » (0, oo). The unique solution 
u of the differential equation 

ii t = b t - a t u t , t>0, 
with initial condition uq > is positive for every t > and satisfies 



lim inf^oo b t 
limsup^at 



< lim inf u t < lim sup u t < 

t- 100 t->oo 



limsup^&t 
liminf^oo a t 



whenever the fractions are well-defined. 
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3 Models and results 



3.1 Plain Random Contact system 

Our first and simplest model is defined in Figure 1. The number of non-seed peers with chunk 
(1) is denoted by X (Y). Peers arrive according to a Poisson process with parameter A, make a 
random contact, download whatever chunk the contacted peer has (if the seed was contacted, the 
downloaded chunk is chosen randomly), then make repeated random contacts at Poisson rate 1 un- 
til the remaining chunk is found, and leave the system. The system relies entirely on randomness, 
with fatal consequences. 



X(Y + 1) 
X + Y + 1 



(X + 1)Y 
X + Y + l 




Apf + |) 
X + Y + l 



X 




Y 



X + Y + l 



Figure 1 : Plain Random Contact system 

The large system limit of the Plain Random Contact system is the dynamical system defined by 
the non-linear ordinary differential equations 

x= {X - y)x y= {X ~ x)y (2) 
x + y x + y 

This system limit is easily seen to be unstable even close to its equilibirium x — y — A: 

Proposition 3.1 In (0, oo) 2 , the system (2) has a single equilibrium (x*,y*) = (A, A). The system 
is not stable in any environment of (x*, y*). Starting with xo > A > yo we have x t — > oo and 
y t — > 0, and vice versa. 

Proof This is immediate from the equations (2). □ 

As long as both x and y are large and roughly of the same size, the system empties rapidly. 
However, it is rather straightforward to prove that the stochastic system is unstable when A is 
larger than one. 

Remark: In this paper, we are not interested in the possible stability of the system when the 
input rate A is sufficiently low. In all our models, A appears in the large scale limits as a pure 
scaling parameter that can be as well chosen to be one. The stability of the stochastic system may, 
however, depend on A. Susitaival and Aalto [12] study by simulations several two-chunk systems 
also from the point of view of stability regions in terms of A. 
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Proposition 3.2 With A > 1, the (stochastic) Plain Random Contact system is unstable. More 
exactly, almost surely either X orY escapes to infinity whereas the other obtains ultimately only 
the values and 1. 



Proof We couple (X, Y) with a process (X, Y) such that 

(i) T>(yt>0X t >X t , Y t <Y^j >0, 

(ii) Pjhm^ooXt = oo) = 1, and 

(iii) P(y t G {0, 1} ultimately) = 1. 

Assume that A > 1 and choose a number a e (0, A — 1). Let X and F be mutually dependent, 
inhomogeneous birth and death processes with up-jump and down-jump intensities defined as 
follows, respectively: 



3\ 
2a(tV 1) 

We have 



y t > l) < E |y t } = y af e' ( *~ s) ds 



Dividing the integration interval into sub-intervals (0, 1], (1, t — 2 logt] and (t — 2 logt, £] we obtain 
the upper bound 

V - / - 2a V t-2\ogtJ 2at 

Let AT denote the counting process of the transitions of Y from 1 to 2. Inequality (3) yields that 
there are a.s. only finitely many such transitions. Indeed, 

E{A^} = E | jT af l m=1} dtj < jf afp(y t > l) dt < oo, 

since the rightmost integrand is 0(t~ 2 ). Thus, Y has the property (iii). Since 

1 f* ~ 
lim - / Y s ds = < a, 

* Jo 

we also obtain (ii). 

Now, choose MeNso large that 

A 

> a + 1, 



set X = X > M, Y = % = and define 

r = inf jt : Xt < at orF t > lj . 
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Then obviously P(r = oo) > 0, and on {r = 00} we can couple (X, Y) with (X,Y) in the 
desired way thanks to domination relations between the respective intensities. Hence, there exists 
a positive number p such that 



P 



lim X t = 00, Y t E {0, 1} ultimately 



Xq — Xq, Y — 



> p > (4) 



when x > M. By symmetry, the corresponding relation holds if X and Y are interchanged. 
Next, let R > (3/2) A and note that 

F((X t , Y t ) e [R V M, 00) x [R V M, 00) ultimately) = 0. 

Indeed, when X t A Y t > R, the total output rate of the system is larger than the total input rate A: 

2 ^ AY ^ > 2 -R>X. 



1 x t a y t + 1 " 3 



Thus, {X t A y t < i? V M} is a recurrent event. By inspecting the intensities depicted in Figure 1 
it is easy to see that the probability of moving from a state with IA7 < fiVMtoa state with 
X A Y = before a change in X V Y is bounded from below by a positive constant not depending 
on the value of X V Y. It follows that 

{X t VY t > RVM, X t AY t = 0} 

is a recurrent event as well. Now, (4) yields the proposition. □ 



3.2 Deterministic First Chunk system 

Our first attempt to overcome the spontaneous imbalance tendency of the Plain Random Contact 
system was the Deterministic Last Chunk mechanism introduced in [8], where each peer decides 
in advance (randomly) which chunk it will download as the last one. The idea was to prevent peers 
from downloading systematically the rarest chunk as the last one before leaving the system. In the 
two-chunk case, defined by Figure 2, it might be more natural to speak about the Deterministic 
First Chunk system. The number of empty peers determined to download chunk (1) first is 
denoted by A (B), while X and Y have their previous meaning. 

Although this balancing rule worked promisingly well in our flash-crowd setup with many chunks, 
the two-chunk system is probably unstable — at least its large system limit 

A ax • A by 

a = b = — (5) 

2 x + y 2 x + y 

(a — y)x . (b — x)y 

x + y ' x + y 

is unstable. 
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X(Y + 1) 
X + Y + 1 




X 



A(X + 1) 
X + Y + 1 




A 



A 
2 



{X + l)Y 
X + Y + 1 




y 



B(y + i) 

X + Y + l 







A 
2 



Figure 2: Deterministic First Chunk system 

Proposition 3.3 The system (5) has no unique equilibrium — its equilibria form the unbounded 
curve 



(a(6),b(9),x(6),y(9)) = (9, 



Moreover, it has an open set of trajectories where two components (either x and b or y and a) 
grow to infinity while the other two remain bounded. 



Proof Choose the initial values so that 

,(0)>6(0)>A, m <lx, a(0) > (X(0) 2 + y ) ( ° ))A 

Then the same relations hold for the whole paths, as seen, with help of Lemma 2.1, by writing 

x f (x + y)X 



x + y \ 2x 



- a , 



x — b 



x + y\2 2y 
y ( x ( (x + y)\ 



a — 



-(x-b) 



x + y \y V 2x 

Moreover, x and b grow toward infinity, y decreases and a approaches the value A/2. 



□ 



We don't have a proof for the stochastic case, but on the basis of similarity in behavior to the Plain 
Random Contact system (and supported by some simulations), we conjecture that this system be 
unstable as well: 
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Conjecture 3.4 With X large enough, the (stochastic) Deterministic First Chunk system is unsta- 
ble — a. s., either X orY escapes to infinity, together with B or A, respectively. 

However, we don't fix a conjecture about a Deterministic Last Chunk system with three or more 
chunks. 



3.3 Friedman system 



Consider an urn containing balls with two colors. The simplest version of Friedman's urn [4] 
works so that one repeatedly picks a random ball from the urn and returns it together with a ball 



of the opposite color. The proportions of the two colors approach 



[3]. 



We now modify the Plain Random Contact system by assuming (without bothering how this might 
be realised) that arriving peers make a random contact and then enter the system with a copy of the 
chunk that the contacted peer did not have (in the case that the seed was contacted, the downloaded 
chunk is chosen randomly); we call this 'complementary' random input 'Friedman input'. This 
system is defined by Figure 3. 



X(Y + 1) 
X + Y + l 



(X + 1)Y 
X + Y + l 



© 




Y 



X + Y + l 



A(*+j) 
X + Y + l 



Figure 3: Friedman system 

The corresponding large system limit is 

(X-x)y . (X-y)x 

x = ■ y = ■ (6) 

x+y x + y 

Proposition 3.5 The system (6) has a single equilibrium (x*, y*) = (A, A) which is globally sta- 
ble. 

Proof The equations tell immediately that with any initial state in (0, oo) 2 , both x and y converge 
mono tonic ally to (A, A). □ 

Moreover, we note that (x — A) 2 + (y — A) 2 is monotonically decreasing: 

^((x - A) 2 + {y- A) 2 ) = ((* - Xfy + (y - X) 2 x) < 0. 

This observation of a simple Lyapunov function can also be used for proving the stability of the 
stochastic Friedman system: 
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Proposition 3.6 The Friedman system defined in Figure 3 is stable for any input rate A. 



Proof Denote by N x+ and N x the counting processes of the up- and down-jumps of the process 
X, respectively, so that X t = X + f*( dN x+ - dN*~). We can then write 

xl - xl = [\(x s _ + 1) 2 - xl) divr - Apr s _ - 1) 2 - xl) dNr 

Jo Jo 

= [ (2X S _ + 1) diV; + - I (2X S _ - 1) d^-. 
Jo Jo 

(Note that N x ~ does not jump when X s _ = 0.) The compensators of N x+ and N x ~ are, respec- 
tively, 



/ H . AX- _ f 

k -Jo X s + Y s + l dt ' A > -J 



X s + Y s + 1 



dt. 



Using similar notation for the process Y, we see that the process X 2 + Y t 2 is compensated to a 
martingale by subtracting from it the process A t = f* a s dt, where 

a = ^4A(X + \){Y + ±) - 2X{X - \){Y + 1) - 2(X + 1)(Y- \)Y^ /(X + Y + 1). 

Denoting m = X A Y, M = X V Y, we have the estimate 

( 4A(M + i)( m + i) - 2M(M - -)(m + 1) - 2(M + l)m(m - -) ] 

m + 1 \ 22 2 2/ 



M + 

2M(m+ 1) / . 1 w 1 

< ^ M2A(1 + -M+- 

- M + m + 1 V 2M ; 2 

< ^(3A+^-M) 

1 

< — 

4 

on the set {M > 3A + 1}. Denote r = inf {t : M t < 3A + 1}. Assume now that M = X VF > 
3 A + 1. Then, the process B t = X 2 At + Y t 2 Ar is a non-negative supermartingale and thus converges 
to an integrable limit satisfying E {-Boo} < E {£>o}> an d B t —A tAT is a martingale (for integrability, 
note that B t is dominated by (X + Y + N x+ + A^ + ) 2 , where N x+ + N y+ is a Poisson process). 
Since A tAT is non-increasing, we have 

B > B Q - lim E {B t } = - lim E {A tAr } > E j / ^ dtl = \e {t} . 

t^oo t^oo J 4 J 4 

Thus, the finite set {(x, y) G N 2 : rr V y < 3A + 1} is reached from any fixed initial state outside 
of it in a time having finite expectation. □ 
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3.4 Delayed Friedman system 



Our first distributed implementation of the idea of the Friedman system (see [11]) was the system 
defined by Figure 4. An arriving peer first makes a random contact and then decides to download 
first the chunk that the contacted peer does not have (in the case that the seed was contacted, 
the peer decides randomly). As with the Deterministic First Chunk system, the number of empty 
peers determined to download chunk (1) first is denoted by A (B), while X and Y have the same 
meaning as before. We call this the Delayed Friedman system, because the subsystem (X, Y) 
obtains Friedman input with stochastic delay. 



X(Y + 1) 
X + Y + 1 



(X + 1)Y 
X + Y + l 




X 




Y 



A(X + 1) 
X + Y + 1 



B(Y + 1) 
X + Y + l 




X(Y 



X + Y + l 



A 



© 



X + Y + l 



Figure 4: Delayed Friedman system. 

The corresponding large system limit is the dynamical system 

Ay — ax • \x — by 
a = — b = (7) 

x + y x + y 

(a-y)x . (b-x)y 

x = y = . 

x+y x+y 

This system is difficult to analyse, because it oscillates heavily around its unique equilibrium 
(a*, b*,x*, y*) = (A, A, A, A). The 'logic' of the oscillating system evolution from an imbalanced 
state, depicted in Figure 5, is the following: 

• there is hardly any input to nor output from y 

• the 'Friedman rule' directs input to a, which accumulates almost all of it, since x is negligi- 
ble 

• when enough mass has accumulated to a, the balance starts to improve 

• once x has become macroscopic, a and y empty rapidly 
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• b has not had time to grow, so we get a situation close to the mirror image of the original. 




20 40 60 80 100 



Figure 5: A trajectory of the large system limit (7) of the Delayed Friedman system with initial 
values a = b = 0, x = 0.1, y = 9.9. Colors: a: blue, b: red, x: green, y: brown. 

Numerical experiments like that shown in Figure 6 (left) hint to global stability, but we have not 
found an explicit Lyapunov function or other means to prove this. As regards local behavior 
near equilibrium, the linearised system at equilibrium is essentially a two-dimensional harmonic 
oscillator — the two remaining dimensions correspond to negative eigenvalues. If we continue 
the numerical computation of a trajectory of (7), we see slow convergence toward equilibrium 
(Figure 6, right; by the way, it is interesting to observe that x + y seems to never descend below 
2). Indeed, the system (7) turns out to be locally asymptotically stable, and this can be shown in a 
non-elementary but basically straightforward way through a center manifold analysis (see [2, 6]). 




Figure 6: The behaviour of x + y, initial state as in Figure 5. Left: t G [0, 1000]. Right: t G 
[1000,2000]. 

Proposition 3.7 The system defined by (7) is locally asymptotically stable. If the starting point 
(do, bo, x , y ) is close enough to the equilibrium (A, A, A, A), the distance to it decreases propor- 
tionally to 1/ y/i. 
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Proof Since the common denominator of the right hand sides of (7) does not affect the trajecto- 
ries, and since A is a pure scaling factor, it is sufficient to consider the system 



x 4 


— X\ £3 










— X2 X 4 


with equilibrium 


P = 


' 1 
1 
1 


(xi 


- x 4 ) x 3 






1 


_(x 2 


- x 3 ) x 4 _ 









At p, the linearized system is given by the matrix 



A = 



-1 





-1 


1 





-1 


1 


-1 


1 








-1 





1 


-1 






This has eigenvalues Ai 2 = ±2 and A3 4 = — 1 . The latter has just one linearly independent 
eigenvector. Let V\ and v 2 be the real and imaginary parts of an eigenvector corresponding to 
Ai = % and let v 3 be an eigenvector for A 3 = — 1 and v 4 such that Av 4 + v 4 = v 3 . Putting 
these into matrix V we obtain the similarity transformation 



V~ 1 AV = 






1 





" 






' 1 


1 





1 


-1 













V = 


-1 


-1 





1 




with 














-1 


1 






-1 





1 














-1 






1 





1 






Thus, the system has a two-dimensional stable manifold W s and a two-dimensional center man- 
ifold W c . The corresponding columns of V are tangents to these at p (see e.g. [2]), and the 
manifolds are invariant under the flow of the system. Trajectories close to p approach the center 
manifold like t e~ l . By the reduction principle (see [6]), in order to study the stability of the 
system, it suffices to know how it behaves on the local center manifold. For this we will reduce 
the system to a normal form (see [6]). 

Take first a linear change of coordinates: 

x = p + V [2] , where u,v eR 2 . 

Then the local center manifold can be expressed as 

W c = {p + V [fa]: \\u\\<d} , 

where h(u) is a two-vector with h(0,0) = and Dh(0, 0) = (the latter because of tangency). 
Write [ j.1 ] = V^ 1 f . Then the requirement of invariance of W c amounts to 

fip + V [t&)\) = Dh{u)rip + V [,&]). 
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This is the equation for h . We can recursively solve the coefficients of the Taylor expansion of 
h . Note that the constant and linear terms are zero. This way we get: 



h(u) = § 



5u 2 + U1U2 — U 2 

-hu\ — \Auiu 2 — 2«2 



+ o(H| 3 ) 



and 



g(u) := f c (p + V [i^)]) = 



-u 2 4~ ( — 10 u\ + 8 u\u 2 - 2 uiu 2 + u% )/9 

Mi 



+ o(H| 4 ). 



The behaviour of the system on the center manifold is given by equation u' = g(u) . Following 
[6] we change to complex variables. An eigenvector of the linear part of g corresponding to the 
eigenvalue % is q — [{] . Setting u = zq + zq we get z = (u 2 — iu±)/2 and 



t = iz+ ^f + !^fz + ^z? + =y&? + 0(\z\' k ). 



32- lit 



-8+7i ^3 



Taking substitution z = w + fa w 3 + f3 2 w 2 w + f3\ w w 2 + @ w 3 we obtain 

w ' = ^ + (^-2/5 3 ) W 3 -^ W 2 «; + (^ + 22A) W w 2 + (^+42/5o)^ 3 + 0(| W | 4 ). 

We see that we can kill other third order terms except the w 2 w -term. Hence choosing 



o _ 7-8i n _ ll+32i a _ n o _ 7^81 



we obtain the equation 
Now, since 

d \ 1 2 / • 
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|w| 2 w + 0(|w| 4 ) . 



32+ lit 
18 



u>| 2 w) W + W {—1 W 



32- lit 
18 



|H w) + 0(\w\°) = -f \w\* + 0(\wf) , 



we see that w — > like l/y/i. Hence the system is locally asymptotically stable. 



□ 



If the third order terms of the equation of w would not have determined the stability, we should 
have gone to higher order expansions. 

Our simulations showed that also the original stochastic system oscillates strongly, with a roughly 
constant amplitude depending on A, even when it is started from a balanced state. The simulations 
suggested that the stochastic system might be stable, but we have not found a way to prove or 
disprove this. From a practical viewpoint, the oscillations indicate that the system is not in good 
balance, which could be fatal in an application with rapidly changing arrival rates and flash crowd 
scenarios. 
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3.5 Enforced Friedman system 



The reason of the oscillatory behaviour of the Delayed Friedman system was that when a peer 
finally succeeds in downloading its first chunk according to the 'Friedman rule', applied possibly 
long ago, the choice may already be outdated and thus counterproductive. Our last model avoids 
this problem by realising choices only immediately or not at all. An empty peer makes three 
contacts simultaneously (sampling with replacement; the seed shows a random chunk). If two of 
the chunks of the contacted peers differ from the third one, the latter is downloaded. If all three 
chunks are similar, nothing is done, but the empty peer stays in a waiting room and repeats the 
triple contact operation after Exp(l) -distributed waiting time. The number of peers in the waiting 
room is denoted by Z. Note that if the experiment was successful (that is, a 2-1 situation was 
obtained), the probability of downloading chunk is (Y + \)/{X + Y + 1) — exactly the same 
as in the Friedman system! Therefore we call this system, defined by Figure 7, the Enforced 
Friedman system. 



X(Y + 1) 
X + Y + l 




X 



3{X + \){Y+\fZ 
{X + Y + l) 3 




(X + 1)Y 
X + Y + l 




Y 



?>{x + \y{Y + \)z 

{X + Y + If 



A 



Figure 7: Enforced Friedman system. 
The large system limit of the Enforced Friedman system is 



i = A-J^ (8, 
{x + yy 

3xy 2 z xy 
{x + yy x + y 

3x 2 yz xy 

y 



(x + y) 3 x + y 

This is trickier than the plain Friedman system, but nevertheless tractable: 
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Proposition 3.8 The system (8) has in (0, oo) x (0, oo) x (0, oo) a single equilibrium point 
(z*,x*,y*) = (|A, A, A). Locally, the equilibrium is a sink, i.e. all eigenvalues of the limiting 
linear system are real and negative. This equilibrium is globally asymptotically stable. 



Proof The uniqueness and local character of the equilibrium are found by easy computations. 
It remains to prove the global stability. Since A is a pure scaling parameter, we can choose A = 1. 
Let us change to the variables 



p = x + y, (3 = 



x-y 

x + y 



which lose the differentiation between x and y in favour of a single imbalance characteristic (3. 
The new system satisfies the equations 

z = 1-^(1-P)z (10) 
p = l(l-/3)(3z-2p) (11) 

$ = -0(1 - P) (j- ~ l) ■ (12) 

Fix arbitrary initial values z > 0, p > 0, (3q E (0, 1) (J5q = gives a simple linear system). 

1°. Note first that the system (z t , pt, fit) cannot escape from the set (0, oo) x (0, oo) x [0, 1) in finite 
time. Denote r = inf {t > : (3 t = 1}. For t G [0, r), z t > whenever z t < 4/3, and p t > 
whenever p t < (3/2)z t . Since the system freezes at r, it follows that inf t > z t > mm(z , 4/3) and 
inf t > p t > 0. Equation (12) then yields that f3 cannot reach 1 at any finite time, so r = oo. 

2°. A basic observation from the equations (9) is that the absolute value of x — y is non-increasing. 
In terms of the variables z,p,/3 this means that f3p 2 is non-increasing, and the corresponding 
equation from which this is seen reads 

^(M 2 ) = ~A(i-AK (13) 

By point 1°, we get 

poo 

lim / [3 s (l-f3 s )z s ds = 0. (14) 

Assume that f3 t — > 1. Since @ t Pt is decreasing by (13), p t cannot be ultimately increasing, and it 
approaches some finite limit value. On the other hand, Lemma 2.1, applied to (10), yields with 
Pt — > 1 that z t — > oo, and (11) makes p t ultimately increasing. This contradiction shows that 
w := 1 — liminf^oo f3 t > 0. 

Since we already saw that ultimately z t > 1, we obtain by (14) and (12) (neglecting the term 3z/p 
that for any e > there exists a number t such that 

poo nT 

e> / P s (l-P s )z s ds> / p s ds = PT-Pt (15) 
Jt Jt 
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for any t and T such that T > t > t . Choosing e = w/2 we deduce that limsup^^ (3 t < 1. But 
then, (14) and (15) imply lim^oo f3 t = 0. 

3°. Now, Lemma 2.1 yields that 2^ and p t converge to their stable values. □ 

In simulations, this scheme seems to work in a perfect way, and it is also capable to deal with flash 
crowds and rapid input rate variations as well (say, when A is large for a while and then suddenly 
drops). We also have good intuitive grounds to believe that the stochastic Enforced Friedman 
system be stable: as noted earlier, the output from the waiting room is pure Friedman input to 
the subsystem (X, Y), just with a randomly varying rate. Since the Friedman system is provably 
stable with any input rate A, the same can be expected from the Enforced Friedman system: 

Conjecture 3.9 The (stochastic) Enforced Friedman system is stable with any input rate A. 

4 Concluding remarks 

Although we could so far prove only some of the stability /non- stability properties that seem to 
hold for our models, some preliminary conclusions can already be drawn. First, we showed that 
it is possible to create effectively stabilizing 'Friedman input' using a fully distributed algorithm 
based on random contacts. Second, one should use a 'waiting room' concept with memory less 
re-trials rather than launch an active hunt for a rare chunk — the rarity may dissolve during the 
hunt, leading to oscillations. Third, the extra delay caused by the waiting room stage is relatively 
small in the mean, certainly worth its price. We also believe that the idea of Enforced Friedman 
input has usable counterparts in many-chunk systems, and we work currently on one promising 
candidate. 

On the other hand, note that our models lack realism as regards the modelling of two aspects of 
the system: search and downloading. Following [7], the slowness of downloading is modelled 
by the Exp(l)-distributed inter-contact times, whereas the contacts themselves are assumed to be 
instant. However, unsuccessful contacts could be repeated much faster than those leading to a 
chunk download. Assessing the significance of this simplification remains for further work. 

Acknowledgement. We thank Balakrishna Prabhu, Rudesindo Nunez Queija, Philippe Robert, 
Florian Simatos (members of the Euro-FGI SCALP project) and Lasse Leskela for fruitful discus- 
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A Appendix 

Proof of Lemma 2. 1 . 

Denote r = inf {t > : u t = 0}. Since u = b > in the case that u = 0, we always have 
r > 0. For t e [0, r) we have 

ii t > inf b s — ( sup a s )u t . 
se[o,r) se[o,r) 

It follows that 

u t > infse[ °^ + (tie - infse[Mbs ) exp(-( sup a s )t), t e [0, r). 

SU Pse[0,r)°s SU PsG[0,r) °s se[0,r) 

Since the right hand side is positive also for every t = r, we deduce that r = oo. 

Assume first that the functions a and b are bounded away from zero and infinity. Let a* , a* , 6* , b* 
be any positive numbers such that 

a* < liminf a t < limsupa t < a*, 6* < liminf b t < Urn sup b t < b* . 
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Denote 



a = inf \ t > : inf a s > a*, sup a s < a*, inf b s > 5*, sup b s < b* 

se[t,oo) se[t,oo) se[t,co) se[t,oo) 



Since a is finite and 



inf b s - ( sup a s )u a+t < u a+t < sup b s - ( inf a s )u a+t , t > 0, 

se[(J,oo) se[a,oo) se[cr,oo) sG[ct,oo) 

we obtain that 

^ + K - ^K a ^ < < — + K - -K a *', t > o, 

a* a* a* a* 

which implies 

— < liminfwi < limsupw t < — . 

a* t^oo j^oo a* 

Since a*, a*, 6*, 6* were arbitrary, the assertion follows. 

Finally, when a and 6 are not both bounded away from zero and infinity and the fractions in 
the assertion are well-defined, the non-trivial cases are obtained as above, simply relaxing some 
conditions in the definition of a. □ 
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